Visualisation of Heterogeneous Data with the Generalised Generative Topographic Mapping
نویسندگان
چکیده
Heterogeneous and incomplete datasets are common in many real-world applications. The probabilistic nature of the Generative Topographic Mapping (GTM), which only handles complete continuous data originally, offers the ability to extend it to also visualise mixed-type and missing data as suggested in (Bishop et al., 1998a). This paper describes this generalisation of GTM and assesses the resulting model on both synthetic and real-world heterogeneous data with missing values.
منابع مشابه
Visualisation of heterogeneous data with simultaneous feature saliency using Generalised Generative Topographic Mapping
Most machine-learning algorithms are designed for datasets with features of a single type whereas very little attention has been given to datasets with mixed-type features. We recently proposed a model to handle mixed types with a probabilistic latent variable formalism. This proposed model describes the data by type-specific distributions that are conditionally independent given the latent spa...
متن کاملTopology-Preserving Mappings for Data Visualisation
We present a family of topology preserving mappings similar to the Self-Organizing Map (SOM) and the Generative Topographic Map (GTM) . These techniques can be considered as a non-linear projection from input or data space to the output or latent space (usually 2D or 3D), plus a clustering technique, that updates the centres. A common frame based on the GTM structure can be used with different ...
متن کاملVisualisation of tree-structured data through generative probabilistic modelling
We present a generative probabilistic model for the topographic mapping of tree structured data. The model is formulated as constrained mixture of hidden Markov tree models. A natural measure of likelihood arises as a cost function that guides the model fitting. We compare our approach with an existing neural-based methodology for constructing topographic maps of directed acyclic graphs. We arg...
متن کاملVisualising and Clustering Video Data
We review a new form of self-organizing map which is based on a nonlinear projection of latent points into data space, identical to that performed in the Generative Topographic Mapping (GTM) [1]. But whereas the GTM is an extension of a mixture of experts, this model is an extension of a product of experts [6]. We show visualisation and clustering results on a data set composed of video data of...
متن کاملNovel Visualisation Methods for Protein Data
Visualization of high-dimensional data has always been a challenging task. Here we discuss and propose variants of non-linear data projection methods (Generative Topographic Mapping (GTM) and GTM with simultaneous feature saliency (GTM-FS)) that are adapted to be effective on very highdimensional data. The adaptations use log space values at certain steps of the Expectation Maximization (EM) al...
متن کامل